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Abstract 

Social systems are in a constant state of flux with dynamics spanning from minute- 
by-minute changes to patterns present on the timescale of years. Accurate models of 
social dynamics are important for understanding spreading of influence or diseases, 
formation of friendships, and the productivity of teams. While there has been much 
progress on understanding complex networks over the past decade, little is known 
about the regularities governing the micro-dynamics of social networks. Here we 
explore the dynamic social network of a densely-connected population of approximately 
1000 individuals and their interactions in the network of real-world person-to-person 
proximity measured via Bluetooth, as well as their telecommunication networks, online 
social media contacts, geo-location, and demographic data. These higlr-resolution data 
allow us to observe social groups directly, rendering community detection unnecessary. 
Starting from 5-minute time slices we uncover dynamic social structures expressed 
on multiple timescales. On the hourly timescale, we find that gatherings are fluid, 
with members coming and going, but organized via a stable core of individuals. Each 
core represents a social context. Cores exhibit a pattern of recurring meetings across 
weeks and months, each with varying degrees of regularity. Taken together, these 
findings provide a powerful simplification of the social network, where cores represent 
fundamental structures expressed with strong temporal and spatial regularity. Using 
this framework, we explore the complex interplay between social and geospatial behavior, 
documenting how the formation of cores are preceded by coordination behavior in the 
communication networks, and demonstrating that social behavior can be predicted with 
high precision. 
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Introduction 


Human societies, their organizations, and communities give rise to complex social dynamics 
that are challenging to understand, describe, and predict. Recently, network science has 
provided a powerful mathematical framework for describing the structure and dynamics of 
social systems Si- With deep roots in traditional sociology [I], 5], a central challenge in 
the description of social systems is understanding social group behavior. Using empirical 
data, such groups (or communities) have recently been shown to be highly overlapping and 
organized in a hierarchical manner 12,3,6 J8j. Without an understanding of the fundamental 
meso-level structures and regularities governing social systems, modeling and predicting 
behavioral patterns remains a challenge |4j[5]. 

While a coherent mathematical framework is not yet in place, existing research suggests 
that social dynamics are far from random. For example, the existence of strong regularities 
for individuals in human populations has been well documented within mobility patterns & 
9 , and in social systems pair-wise interactions show clear patterns occurring at multiple 
timescales from seconds to months 116j. For groups of interacting individuals, however, an 
understanding of the fundamental structures and their temporal evolution across timescales 
has proven elusive so far, suggesting a potential for a better understanding and models 
describing important processes such as spreading of influence or diseases, formation of 
friendships, or productivity of teams. 

Our work is based on a longitudinal (36 months) high-resolution dataset describing a 
densely-connected population of approximately 1000 freshman students at a large European 
university [lO|. We consider interactions in the network of physical proximity measured via 
Bluetooth (see methods), complemented with information from telecommunication networks 
(phone calls and text messages), online social media (Facebook interactions), as well as 
geo-location and demographic data. 

Until this point, community detection in dynamic networks has required complex 
mathematical heuristics 01 121. Here we show that with high-resolution data describing 


social interactions, community detection is unnecessary. When single time slices are shorter 
than the rate at which social gatherings change, communities of individuals can be observed 
directly and with little ambiguity (Fig.[l^). Using a simple matching between time slices we 
can infer temporal communities. These dynamic communities offer a powerful simplification 
of the complex system of social interactions as it develops over time. 

Below we describe these findings in detail. We then study the properties of gatherings 
and cores, show how appearances of social cores are preceded by increased communication 
among their members, and examine how cores tend to behave as social units. As an 
application of the framework, we show that the social dynamics of this population are 
highly predictable. 


Results 


Social interactions unfold on many timescales, with structures and regularities spanning from 
minute-by-minute changes to yearly rhythms and beyond, as observed in telecommunication 
networks 16 or online social networks |20|. Despite these inherent dynamics, most of the 
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current understanding of social networks comes from the study of static network topologies, 
not including temporal aspects [lj2|. Human social communities are inherently overlap- 
with each and every individual participating in multiple communities. 


13,22 


pmg 

In time-aggregated networks, this structural property results in communities with more 
outgoing than incoming edges, the ‘hairball’ structure visualized in Fig. la (green network). 
With minute-by-minute observation intervals of this network, individuals are constrained to 
a single community per time slice, and we can observe the basic structural elements directly 
and with little ambiguity, as illustrated in Fig. & (blue network). More generally, when 
time slices have a duration which is shorter than the rate with which the group composition 
changes, gatherings of individuals can be observed directly, rendering traditional community 
detection algorithms redundant (the short time-scale time slices, Fig. & (blue network) are 
strikingly different from random subsamples of equal size from the daily networks, see SI). 
Matching connected components across time slices, we can study the temporal development 
of social gatherings. In our dataset, gatherings are typically together for a few hours (full 
statistics available in SI), and can be thought of as instances of social structures {cores) 
that occur repeatedly over days, weeks, and months. 


Gatherings 

In the physical proximity network, meetings are require all members to be present at the 
same time, and can exist between multiple individuals. This implies that gatherings can be 
directly identified as graph components (consisting of individuals in close physical proximity) 
within each 5-minute time slice, Therefore, we are able to use a simple hierarchical clustering 
to match groups across time slices (see SI), using a matching strategy is similar to Ref. 

A node may be part of only a single gathering per time step, but over time individuals 
flow in and out of social gatherings, shifting their affiliation, and forming new gatherings 
as illustrated in Fig. E>- The gatherings display broad distributions in both size and 
duration, capturing meetings ranging from small cliques to large aggregations, and from 
short interactions on the order of minutes to prolonged encounters lasting many hours, 
covering a wide range of meeting types. Gatherings are defined for any number of nodes 
greater than one, but since we are interested in group dynamics, we only discuss gatherings 
of size three or greater in the statistics reported here. Since our cohort consists of university 
students, an important inhomogeneity in the data is between ‘work’ activities that take 
place on campus (including scheduled classes) and ‘recreational’ activities that take place 
off campus. Utilizing GPS information, we find that 42% of gathering take place on-campus 
(work) and 58% take place off-campus (recreation). Comparing work/recreation statistics, 
recreational gatherings tend to be smaller but last considerably longer, illustrating that the 
context of meetings can influence their properties (see SI for full statistics). 

The fluid behavior illustrated in Fig.flja results in ‘soft’ gathering boundaries, with some 
members participating for the total duration of the gathering and others participating only 
briefly. Peripheral members can be acquaintances briefly interacting with members of the 
social core, but can also be spurious connections in the data: nearby strangers eliciting a 
Bluetooth measurement not corresponding to a social connection. In spite of these soft 
boundaries, we find that gatherings are characterized by a stable core of individuals that 
are present during the majority of each meeting (see SI). This stable core is expressed in 
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participation profiles as illustrated in Fig.fl];. Here, a participation profile is the sorted 
fraction of time each node has participated in a particular social context, normalized by 
its total lifetime. A pronounced core structure implies a gap in the participation profile, 
separating the core members (Fig.[lJ;, left, dark gray bars) from the peripheral nodes 
(Fig#, left, light gray bars). We test if the gap is statistically significant by comparing 
to an ensemble of profiles generated by a random process (Fig.[iJ, right), where a random 
participation level between 0 and 1 is assigned to each node from a uniform distribution. 
Based on the ensemble, we estimate the average expected gap size and deviation. If the gap 
observed in the empirical data is greater than the average null-model gap Random plus one 
standard deviation cr ran( j om , we accept the core as significant. According to this criterion, 
we find that 7146 out of the 7 320 (97.6%) inferred communities display a pronounced core 
structure. 

Cores 

A gathering represents the time evolution of a single meeting between a group of individuals. 
In most cases, however, gatherings are an instance of a lasting social context (e.g. a group 
of friends or class-mates), and we observe the same nodes participating in subsequent 
gatherings occurring repeatedly over the following days, weeks, and months. We call the 
social structures corresponding to all gatherings of the same set of individuals cores. We 
argue below that cores represent a fundamental structure expressed in dynamic social 
networks with strong temporal and spatial regularity. Below we discuss cores. How cores 
are identified, and how cores can be used to quantify the regularity of social interactions, 
as well as predict future behavior of individuals in our dataset. 

In this population, the number of appearances per core is a heavy-tailed distribution; 
most cores appear only a few times, while the most active cores can appear multiple times 
per day over the full observation period (see SI). Here we focus on the temporal patterns 
of recurring gatherings, restricting our analysis to cores of size three or greater that are 
observed more than once per month, on average. The split between activities that occur 
on campus is important for cores as well as gatherings, and Fig.[2^ shows the difference of 
how individuals engage and spend time in different social contexts (work cores —primarily 
observed on campus—and recreational cores —primarily observed elsewhere). While the 
distribution of number of recreational cores per person is broad, most participants are part 
of only one or two recreational cores (Fig. [2^,, top panel). The distribution of work cores 
per person is localized with an average of 2.74 ± 1.85 work cores per person (presumably 
corresponding to classes and study groups). 

We find that cores leave traces in other data channels that emphasize the differences 
between work and recreational meetings. One such trace is coordination behavior, which we 
can explore by investigating how call and text-message activity increases in the time leading 
up to a meeting. We define a core’s level of coordination ct at time t as the average increase 
of activity of its members prior to a meeting. Specifically, we set ct = 1/N X)n=i a ?/ a ti 
where N is the number of participants, a” is the individual activity of person n in time-bin 
t (indexing the hours-of-the-week), compared to an individual baseline which is simply the 
person’s average activity a” in that hour of the week; to generate the curves in Fig.[2|we 
then average over cores. We observe clear evidence of coordination prior to meetings, with 
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Figure 1: Properties of gatherings, a, The social network at different timescales. The network 
formed by face-to-face meetings within one day (green), 60-minute (orange), and 5-minute temporal 
aggregation (blue). In the 5-minute time-slices groups are directly observable without much ambiguity, 
but the overlap between groups increases as time is aggregated across bins, b, Illustration of gathering 
dynamics. Gatherings evolve gradually with members flowing in and out of social contexts, c, 
Extracting cores from participation profiles. Dark gray bars denote nodes with participation levels 
above the maximal gap. Ordered participation profile for a empirical data (left), as a null model we 
use participation profiles generated from a uniform random distribution (right). 
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Figure 2: Cores summarize social contexts for individuals, a. The distributions of work 
and recreational core memberships, the top panel shows that individuals typically participate in only 
one or two recreational contexts, although the tail of the distribution contains individuals with more 
gregarious behavior. Inset shows the distribution of participation in cores overall (both work and 
recreation), b, Coordination prior to meetings defined as increase of phone calls and messages 
occurring within a hourly time-bins prior to the meeting relative to a null model based on the 
average hourly telecommunication behavior for each participant. More coordination is required to 
organize meetings during weekends than during weekdays. Larger meetings do not require additional 
coordination per participant. 


a stronger effect during weekends (Fig. @>, upper panel). This result explicitly shows the 
extra coordination cost associated with non-schedule driven interactions. Conversely, the 
coordination cost per participant does not depend on the size of the gathering in question 
(Fig .J2]}, lower panel), suggesting a social optimization process takes place (since the number 
of social connections grows as n(n — l)/2 for a group of n participants). 

Modeling the social network 

In order to place the significance of the basic statistics regarding gatherings and cores in 
perspective, we create a null model for the full network dynamics (see SI for full details). 
We model each time-slice as a random geometric graph (RGG), and generate dynamics by 
representing each node as a random walker, similar to models known from the literature [5]. 
For reasonable parameter values, we find that this dynamic RGG model creates gathering¬ 
like components, and is able to recreate qualitative features of the distribution of both 
gathering sizes and lifetimes. Thus, as suggested in the literature [H], we are in fact able 
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to model key statistical features of gathering-occurrences across a single day, using a very 
simple model. 

The fundamental difference between our empirical system and the dynamic RGG model 
arises because the model does not capture correlations between groups of nodes. This 
means that the dynamic RGG model does not generate recurring gatherings, and is unable 
to model dynamics across weeks and months. The fact that gatherings in the dynamic 
RGG model only occur once means that empirical statistics on the level of cores are not 
reproduced by the model. 


Cores are social units 


It is well established that two individuals who share a social tie have correlated mobility pat 
terns 15 18 . Because interactions with social ties tend to happen at irregular intervals 28 


using this correlation for location prediction has proven difficult. Expressed differently, we 
know that the location of person A is going to be predictive of person B’s location (and 
vice versa ) when they meet, but we rarely know at which point in time A and B will meet. 

As we have defined them, cores represent meetings between n > 2 individuals, character¬ 
ized by the fact that all n members are usually present when the core is active. Therefore 
an observation of an incomplete set of core members implies that the remaining members 
will arrive shortly, a fact which can be used for prediction. We call this the property of 
cores the ‘social unit’ attribute. 

We illustrate the social unit attribute using cores of size three. Given that two members 
of a core are observed, we measure the probability that the remaining member will arrive 
within one hour. To avoid testing on scheduled meetings, we only consider weekends and 
weekday evenings and nights (6pm-8am), where meetings are not driven by an academic 
schedule. Further, we test on a month of data that has not been used for identifying cores. 
We compare the behavior of actual cores to two null models, both based on an aggregated 
graph of daily interactions. 

The first null model (random), simply illustrates that our results are not driven by 
spurious co-locations. In the random null model, we create an aggregated daily graph 
for each day in the observation period. We then choose a random day and pick three 
individuals at random. If two of those individuals appear in the same location, we measure 
the probability of the third node arriving within one hour (including the reference group 
in the statistics only if at least two of its nodes are co-located during the day, see SI). 
This situation is illustrated using blue nodes in Fig. [3^,. The random null model shows no 
predictive power (Fig.JsJa). 

For the second null model (BFS) we test the effect of pairwise friendships within groups 
of three. Here, we form reference groups using a breadth first search (BFS) strategy. We 
choose a random day and based on a randomly chosen seed node, we perform BFS steps 
until the neighborhood-set is large enough to form a group of size three; then we create 
a reference group based on the seed and two random nodes from the neighborhood. This 
situation is illustrated in Fig. [3^., where the BFS starts from a yellow seed node expanding 
to two randomly chosen neighbors marked in green. We then choose situations where 
two nodes from this set are subsequently co-located and test how often the third BFS 
reference-group member arrives within the next 60 minutes (see SI). The BFS model implies 
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at least pairwise relationships between the (yellow) seed node and the the two other (green) 
group members. 

Comparing the performance of cores with the two null models (Fig.JdJr) we find support 
for the social unit attribute. The random null model clearly shows that spurious connections 
do not carry a signal, and the BFS model illustrates that even if all three nodes form a 
connected subgraph, those subgraphs have a less than 10% probability of the third member 
arriving within the hour. In contrast, the cores capture arrivals of the remaining member in 
close to 50% of cases in spite of cores having soft boundaries. These observations provide 
support for, and quantify, the social unit attribute. Cores require all members to be present. 

Ego perspective: Core participation forms social trajectories 

From the perspective of an individual, we find the situtation displayed in Fig.[3j3, where 
each ego is involved in a number of overlapping and nested cores; the distribution (number 
of memberships) is shown in Fig[2^i, inset. Figure [3]i shows when cores are activated across 
our observation period from ultimo January to ultimo April, with time running horizontally 
and each row/color corresponding to a core, with colors matching Fig. [3^; we call the core 
instantiation profile an individual’s ‘social trajectory’. 

Replacing the complex temporal dynamics of the social network with the sequence 
of participations in the core instantiation profile offers a powerful simplification of the 
complexity of a dynamic social network. Since each core represents a social context the set 
of an individual’s cores provides a finite set of states (a ‘vocabulary’) for quantifying their 
social life. 

The social trajectory shown in Fig.[3]d also provides an important connection to research 
within human mobility. Based on mobility data, it has recently been shown that human 
mobility patterns are regular, and that given a sequence of location-observations, a person’s 
geographical location in the next time-bin can be predicted with high accuracy (an average 
of 93% of the time) |:8]. Interestingly, the problem of predicting the next instantiation 
of a social core is equivalent to predicting the next location in a sequence of locations, 
once we summarize the dynamic network of social interactions using the social trajectory 
formulation. In the analogy, a social core corresponds to a location in physical space, and 
instantiating a gathering corresponds to visiting that location. This implies that we can 
use the methods developed in |8] to estimate an upper bound for the predictability of social 
trajectories. The upper limit of predictability is derived from the amount of repetition 
(routine) encoded in a sequence of observations. 

The central property needed to estimate the bound on predictability is the time-correlated 
entropy S, which quantifies the amount of uncertainty within a data sequence, accounting 
for frequency and ordering of states. The upper bound on predictability, II, can then be 
determined by applying a limiting case of Fano’s inequality [l9j (see SI). 

Since we have access to participants’ location traces as well as their networks, we can 
calculate the upper bound on predictability for both social trajectories and location traces 
across the population. In order to incorporate the full complexity of social interactions 
in the calculation of the time-correlated entropy, we include cores with any number of 
appearances, as well as cores consisting of two nodes in these calculations. In Fig. [3^, we 
display the distributions of time-correlated entropy and corresponding upper bound on 
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Figure 3: Cores predict social behavior, a Example of an aggregated graph of daily interactions, 
illustrating cores and construction of null models. Blue nodes correspond to the random null model, 
yellow/green nodes illustrate the BFS null model, and red nodes show a three-core. See main text 
for full details, b Percentage of predictive groups in each category: Cores, BFS reference-groups, 
and random groups. Reference model errorbars are calculated across n = 100 independent trials, c, 
Example ego view of communities; we observe overlapping and nested structures, d, The temporal 
instantiations of the cores in c. Time runs horizontally and each row corresponds to cores (colors 
matching c). e, The distributions of time-correlated entropy (S) and predictability (Tl) for social 
and location patterns. We find that overall social patterns tend towards lower entropy than geospatial 
traces, resulting in higher predictability. 
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predictability for the population. Comparing the two distributions, we find that the typical 
social behavior is characterized by lower entropy and thus higher predictability than the 
typical mobility behavior. In the dynamic RGG model, which we use as a null model for 
the full network dynamics, the temporal entropy does not converge; this implies that there 
is no social predictability in this model (see SI). 


Comparing the empirical mobility behavior to the literature 18,30J, we find a lower 
average predictability limit of approximately 80% (Fig.[3^i). The fact that the location- 
predictability is somewhat lower than the 93% previously reported [8] is connected to a 
number of factors: the non-representative socio-demographics of the study population may 
play a role, the GPS location data used here has significantly higher spatial resolution 
than the cell towers used in previous work |8,30|—a fact which is known to decrease the 
estimated predictability 1201), and we focus on predicting the next location in a sequence 
of locations rather than predicting the person’s location in the next time-bin—a more 
challenging prediction task (see SI). 

The overall level of social and geospatial predictability is not correlated for individuals 
(p -value = 0.85, see SI). In spite of this lack of correlation we know that the predictability 
encoded in social or geospatial trajectories is closely related to daily and weekly schedules 32 


A deeper exploration of the connection between social and geospatial behavior across the 
week provides an example of how the social trajectories enable a joint socio-spatial description 
of an entire population. 

First we consider the typical behavior by calculating the uncorrelated entropy (see 
methods) for the social and geospatial trajectories independently. As a measure of disorder, 
the entropy quantifies how unpredictable a pattern is: the higher the entropy, the more users 
tend to distribute their time between many distinct states. Figure [4^, shows the entropy 
averaged over the population for each day of the week, in 8-hour bins. Nights running 
from midnight to 8am, days spanning 8am to 4pm and evenings from 4pm to midnight. 
Light colors indicate high entropy (complex behavior) and dark colors low entropy (simple 
patterns). 


The geospatial behavior (Fig. upper panel) is characterized by low entropy on 
weeknights, essentially corresponding to sleeping in one or two locations. The entropy is 
high during weekdays and assumes mid-range values during most evenings with a strong 
exception on Friday and Saturday nights, which are characterized by the highest entropy of 
the entire week. These ‘party nights’, are consistently the time-bins with highest average 
geospatial entropy, corresponding to exploration behavior. 


The social behavior (Fig. [4^,, lower panel) is similar to the geospatial behavior across 
most of the week: Predictable nights, varied days, and evenings somewhere in between. 
On Friday and Saturday night, however, the social trajectories display behavior that is 
significantly different from the collective pattern arising from the geospatial trajectories. In 
these time-bins, when the study participants are most exploratory in a geospatial sense, they 
appear to be highly conservative in social sense, displaying simpler and more predictable 
social behavior. This finding is consistent with Fig. [2^,, which shows that a large majority 
of participants focus their off-campus life on a small number (one or two) social cores. 

The observations above suggest that during the week, the population is characterized 
by a pattern of the same people meeting in the same places, with exploration peaking 


10 





on Friday nights and weekends. On Friday night and weekends when the population is 
most unpredictable in their geospatial behavior, they are highly predictable in their social 
behavior, exploring a range of locations, but always with the same core of friends. 

In interpreting this result, it is important to realize that we only observe the social 
interactions among participants of the experiment. While the geospatial data-stream is 
sampled evenly over the observation period, the social stream has a potential bias, e.g. it is 
possible to go out with non-university friends on Fridays and weekends. We can address 
this caveat by considering the behavior of cores across the week (Fig.JdJ}). In Fig. [4^ we 
consider the behavior of cores across time, displaying the uncorrelated location entropy of 
‘core location histories, i.e. the sequence of locations visited by each core (averaged and 
binned as above). In interpreting the absolute values of entropy in Fig.[4}3, note that cores 
typically only meet in a few locations; thus, in turn cores have fewer location states than 
individuals, resulting in smaller average values for the entropy. 

Figure]!] shows directly that cores do display a distinct exploratory behavior on Friday 
nights (and weekends more generally). The fact that geospatial exploration occurs as part 
of a social group, but constrained to certain time-bins reveals a complex interplay between 
time, location, and social context, and supports the hypothesis that at times when humans 
are most unpredictable in the geospatial domain, they display predictable social behavior. 


Discussion 


The freshman participants of this study are not a representative sample of society as a 
whole, and we expect that some aspects of the findings presented here reflect that this is a 
particularly youthful demographic sample (e.g. geospatial entropy peaking on Friday and 
weekend nights). While the population studied here is not representative of society as a 
whole, we argue that the methods developed, as well as many of our findings do generalize 
to more representative populations. Below we summarize the most important of those 
findings. 

It is often the case that the mathematical description of networks grows much more 
complex once the temporal dimension is included 14], with even basic network properties, 
such as degree, clustering coefficient, or centrality having multiple competing generalizations 
to the temporal domain. Here, we observe the opposite effect, that additional temporal 
information can simplify the description By observing the network at high temporal 
resolution we can directly observe gatherings, a phenomenon which is obscured in time- 
aggregated networks [3]. In another context 113 , data on empirical network flow has been 
used as extra information in order to uncover overlapping communities. A simple matching 
across time slices reveals dynamically changing gatherings with stable cores that provide 
a strong simplification of the social dynamics. These cores leave traces in other data 
channels, for example eliciting coordination in the telecommunication network. The cores 
are fundamental units in the sense that they require all members to be present, a fact which 
we can use to predict the future location of other (non-present) members. 

Connecting our findings to the literature of dynamic community detection, we note that 
many elegant methods exist in the literature that would allow us to detect gatherings in a 
daily network |2, ll][l2,2ljj23 , but here we have chosen to use a simple matching of graph 
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Figure 4: Temporal aspects of geospatial and social predictability, a, Uncorrelated entropy 
respectively for location (upper) and social (lower) states, averaged across all individuals. Showing 
how unpredictable individuals are within 8-hour bins over the course of a week, b, Geospatial entropy 
of cores, indicating when cores, i.e. individuals together in a social setting, explore new geospatial 
locations. 
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components to underline that social structures in individual snapshots are so obvious that 
these sophisticated methods are unnecessary. None of the methods cited above, however, are 
designed for detecting communities at multiple temporal scales; therefore they are unable 
to identify cores and help us understand their patterns over time. 

Using an individual’s cores as a set of social states, we can define a social trajectory 
which simplifies the description of their social activity, allowing for an exploration of the 
predictability encoded in social routine. We find that, for individuals in our population, 
times of exploration in the geospatial domain are connected to a small subset of social 
circles, suggesting a general hypothesis that when humans are most unpredictable with 
respect to location we are most predictable with respect to our social context. 

In summary, our work provides a first quantitative look at the rich long-term patterns 
encoded in the micro-dynamics of a large system of interacting individuals, characterized 
by a high degree of order and predictability. The work presented here provides a new 
framework for describing human behavior and hints at the promise of our approach. While 
we have focused on predictability, we expect that our work will support better modeling of 
a multitude of processes in social systems, from epidemiology and social contagion to urban 
planning, organizational research, and public health. 


Materials 

The dataset 

The proximity interaction network of participants from the Copenhagen Networks Study 
based on Bluetooth scans, collected using smartphones. Physical proximity measured via 
Bluetooth corresponds to distance between 0 and approximately 10 meters, depending on 
environmental conditions. As false positives (reported observations of users not actually 
present) in Bluetooth scans are unlikely, we use a symmetrized (undirected) network of 
interactions. 

Informed consent 

Data collection was approved by the Danish Data Protection Agency, and informed consent 
has been obtained for all study all participants. 

Definitions of entropy 

For an individual i given a sequence of states we define entropy, or uncertainty, in two ways, 
(1) uncorrelated entropy S) unc = — Pj l°o 2 Pj > where p 3 is the probability of observing 

state j, captures the uncertainty of the behavioural history without taking the order of visits 
into account; (2) temporal (or time-correlated) entropy 5 temp = _ „ p(T-) log 2 [p(T/)], 

where p(T') is the probability of finding a subsequence T[ in the trajectory 7j, takes 
both frequency and order of states into account. From the entropy one can estimate 
the upper bound of predictability by applying a limiting case of Fano’s inequality [8j[l9|: 
Si = H{Ui ) + (1 - II,;) log 2 (-/V - 1), where #(11*) = -11,; log 2 (IU) - (1 - II,;) log 2 (l - Ifi) 
and N is the number of states observed by person i. 
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Fundamental Structures of Dynamic Social Networks 

Supplementary Information 



SI Summary of main results 


Figure [ST| illustrates our main findings. Social groups display a complex temporal behavior, 
with dynamics spanning multiple time-scales. Typically, incorporating the temporal dimen¬ 
sion drastically complicates the mathematical description of complex networks such that 
community detection methods require sophisticated mathematical heuristics to disentangle 
the web of interactions. By observing social interactions at the right time scale—when 
the temporal granularity is higher than the turnover rate—we can directly observe social 
gatherings. Figure[ST^i-c shows social networks obtained using three temporal-windows of 
increasing size. While daily and hourly windows of aggregation obscure social relations 
(Fig.[Slfc,-b), a micro-level description directly reveals the fundamental structures. Applying 
a simple mathematical matching scheme across time-slices reveals dynamically evolving 
gatherings with soft boundaries and stable cores (Fig. SO- Unlike the typical community 
detection assumption of binary assignment, it is clear that some members participate for the 
total duration of the gathering, while others only participate briefly (Fig. |Slf i). Matching 
cores across longer time-scales allows us to observe dynamics that unfold over weeks and 
months. Cores provide a strong simplification of the social dynamics (Fig. |Sl^ ), and are 
manifested throughout other data channels such as coordination behavior via call and 
text messages. To demonstrate the saliency of our description we use the social contexts 
provided by cores to quantify the predictability of social life and give a proof of concept of 
a new type of non-routine prediction. 



Figure SI: Summary of main findings, a-c, Network slices obtained by slicing the social dynamics 
using varying temporal windows (1 day (green), 1 hour (orange), and 5 minutes (blue)). Below, 
adjacency matrices colored in agreement with networks, and sorted according to component size, 
d, Gatherings have soft boundaries. The size of each node represents the level of participation, e, 
Cores simplify social dynamics and provide a context for social interactions. 


A note on correlations induced by temporality 


It is instructive to consider the magnitude of the correlations induced by temporality. We 
investigate this question as follows. Let us say that the network in Fig. [STf has E c edges. 
We now generate a random network based on Fig. [UK where E c edges are sampled at 
random. Now we compare the original time-slice (Fig. |Sl| c:) with this reference network. 
Fig. S2 summarizes the key differences. Figure |S2|A and B provide a visual comparison of 


the networks, which strikingly illustrates how far from a random sample of the aggregated 
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Degree Clustering 


Figure S2: Fundamental difference induced by temporality. A. A snapshot of social interactions 
within a 5-minute temporal bin, with E edges. B. Three reference networks constructed by randomly 
sampling E edges from the daily interaction graph (Fig 1A, main manuscript) . C. Degree distribution 
for the network in A and 1000 random references networks constructed as described in B. D. 
Clustering distribution. Sampling E edges from the daily network produces mainly one big component 
with low probability of observing high degree nodes and with low clustering as compared to the 
temporal snapshot. 


network each temporal network slice is. Subfigure A shows the original network from 
Figure Sic with E c edges, and Fig. S2 shows three examples of subsamples of E c edges 
sampled from the network in Figure SI a. We see that whereas the original time-slice has 
many medium sized components with high clustering, the resampled networks form a single, 
sparse component with low clustering and a few isolated dyads. The basic network statistics 
confirm how radical this change is. Figure [S2|C shows the change in degree distribution, and 
Figure [S2p quantifies the change in the clustering-distribution as we go from the highly 
clustered real-world data to the tree-like random subsamples. 


The remainder of the SI document is organized as follows. Section 2 describes the 
dataset, Section 3 explains the details of how gatherings are constructed, and describes their 
basic statistics, including a discussion of dyads. Section 4 shows how cores are extracted and 
goes into detail on their temporal behavior, as well as the sub-core structures. In Section 
5 we provide full details regarding the routine-based geospatial prediction as well as the 
social context prediction presented in the main text. Section 6 provides background on our 
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model for purely social prediction and, finally, Section 7 discusses background information 
regarding the coordination leading up to meetings. 


S2 Data 


We consider a dataset from the Copenhagen Networks Study. It spans multiple years 
and measures with high resolution: physical interactions, telecommunications, online 
social networks, and geographical location. In addition, the dataset contains background 
information on all participants (personality, demographics, health, politics). These data 
are collected for a densely connected population of approximately 1000 students at a large 
European university. Data is collected by running custom built applications installed on 
1 000 smartphones (Google Nexus 4). Full details can be found in Ref. 


10 


In this manuscript we focus on detecting and tracking co-located groups of individuals 
during a representative period of five months (roughly one semester), collected between 
January 1st and June 1st of 2014. The Bluetooth sensor collects proximity data (~ 0 —10 m) 
of the form ( i,j, t, s ), where each interaction implies that person j has been in proximity of 
person i at time t, where the devices observe each other with signal strength s [24]. Bluetooth 
scans do not constitute a perfect proxy for face-to-face interactions. In fact multiple scenarios 
exist where people in close proximity do not interact and vice versa, nevertheless Bluetooth 
can successfully be applied in order to sense social networks [10, 24,25], Further, our 


gathering/core-description naturally filters our spurious connections by considering social 
structures that occur over across extended periods of time. Gatherings and cores are 
identified in the proximity network, and the remaining communication channels are used 
for validation purposes. In addition, we reserve proximity data from the month of May 


for validation purposes. Table [SI] shows statistics across the various data sources for 814 
individuals, on whom we focus due to their high data quality. 


Data source 

Total 

Unique 

Bluetooth interactions 

14 673 869 

154 818 

Call & text interactions 

75 364 

1216 

Geographic locations 

18 603 072 

- 

WiFi access points 

1663483 977 

2 412 702 


Table SI: Data overview from January 1st - June 1st. Bluetooth and call & text logs are summarized 
for within-participant relations and do not include external interactions. The unique field denotes 
the number of distinct observation of each quantity, e.g. number of uniquely observed links. 


S2.1 Construction of temporal network slices 

The data collection application triggers Bluetooth scans every five minutes from the time a 
phone is turned on; for this reason, the collection of sensors does not follow a global schedule. 
To account for this behavior we divide all temporal information into absolute time-windows, 
A minutes wide. Within each temporal bin, we draw a unweighted undirected link between 
two individuals if either i has seen j or vise versa. 
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S2.2 Selection of time-scales 


The scanning frequency of the application sets a natural lower limit of the network resolution 
to 5 minutes, however, there is no such upper limit for aggregation. So-called natural 
timescales have previously been investigated for specific networks with respect to their global 
topological properties [26 - 28j. Here we consider the correlation between slices (or turnover 
of nodes between slices) defined as C = \E^ n E^_ 1 \/\E^ U E ^_ l |, where denotes the 


set of edges that are observed in bin i with given temporal width A. According to Fig. S3 


the average correlation decreases sharply as function of window-size, achieving maximum 
correlation for small bins. Slicing network dynamics into short slices (high resolution) also 
disentangles the network (Fig. SI), thus when time slices are shorter than the group’s 
turnover rate, we can directly observe individuals’ group affiliations. Based on Fig. |S3| we 


chose a temporal width of 5 minutes, but windows of 10 minutes could have been chosen 
without deterioration of results. 



Figure S3: Average correlation between network slices, averaged across all time-bins. Inset shows 
a closeup for the smallest bin-sizes. Calculated for proximity data form March 2014. 


When we consider the duration of time-windows, there are two aspects to consider, a 
methodological perspective and a practical perspective. From the methodological perspective, 
using any finite time window shorter than 10 minutes (as argued in Fig. S3), will produce 
analyses and results that are consistent with what we have presented in the main text. 
Using a resolution finer than 5 minutes could potentially be interesting if one was interested 
in measuring the precise dynamics of group formation (answering questions such as: what 
is the precise sequence of arrivals?, does a certain group member consistently arrive before 
everyone else? etc.) or group dissolution (e.g. someone leaving 30 seconds before the rest 
of the group could be an important social signal). 

Considering the practical perspective, it is clear that measuring e.g. once every mil¬ 
lisecond would produce 300 000 times more data than we are currently collecting, while 
adding very little extra information about the social system. For the type of dynamics 
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considered in the main text (e.g. individuals moving from one gathering to another) 5 or 10 
minute time-bins arguably present an useful trade-off between measurement accuracy and 
the meaningful network changes. 


S3 Gatherings 

In this section we describe how connected components in the proximity network are matched 
across short timescales into dynamical ensembles, which we denote as gatherings. We then 
present fundamental statistics on gatherings (size distribution, durations, stability, start/end 
times), and analyze these properties in the light of on/off-campus behavior. Finally, we focus 
on identifying repeated gatherings across longer timescales to infer dynamical communities. 


S3.1 Detecting gatherings 


In each temporal slice we identify connected components, i.e. nodes that are in close 
physical proximity as social groups. Since dyadic relationships qualitatively differ from 
group relations 11 ,29 1 we generally treat components of size two separately. 

A gathering is defined as a group that is persistent across time. To identify gather¬ 
ings we apply agglomerative hierarchical matching, a widely used method, that merges 
groups based on their mutual distance (defined below) [30,31 , using a matching strategy 


similar to Green et al. 14 . Each group is initially assigned to its own cluster, then every 


iteration-step merges the two clusters with smallest distance according to the single linkage 
criteria (min(dj(ct, Cf)))- This merge criterion is chosen because it is strictly local and will 
agglomerate clusters into chains, a preferable effect when clustering groups across time. 
The clustering process is repeated until all groups have been merged into a single cluster. 
Distance between groups is calculated using a modified version of the Jaccard similarity: 


d{c t ,c t >) = 1 - | /(At, 7 ), 

Q U Cf/\ 


(SI) 


where /(At, 7 ) is a term that denotes the coupling between temporal slices and At = t! — t 
denotes the temporal distance between two bins (for consecutive bins At = 1). The function 
can assume any form, increasing or decreasing; we utilize it to model decay between temporal 
slices, with the two most prominent forms being: exponential (exp (—7 (At — 1))), and 
power-law (At~ 7 ), see Fig. S4 Thus, by definition the term assumes the value 1 (zero 


decay) between two consecutive temporal slices. For computational reasons we only focus 
on gatherings identified using exponential decay with 7 = 0.4, other decay parameters yield 
similar results see SI Section IS3.1.2I 


S3.1.1 Partitioning the dendrogram 

The method described above iteratively constructs a dendrogram where temporally localized 
groups are hierarchically clustered. To extract meaningful social structures we need to 
partition the dendrogram (Fig. |S5|). Modularity and partition density have previously been 
applied for similar purposes j;3j,'32j, but these do not generalize well for temporal processes. 
Instead we consider the cluster stability with respect to local and global measures. Local 
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Figure S4: Temporal coupling between two temporal slices as function of the decay parameter 7 . 
By definition the decay is zero between consecutive slices (At = 1). a, Exponential decay, b, Power 
law decay. 


stability ( 77 ) is calculated from the average node-wise overlap between consecutive slices 
(Fig. [S6}i.) . while global (a) is calculated from the average overlap between all slices and 


the aggregated structure (Fig. S 6 a): 


^death 1 

Y J(st,9t+i) 


t —^birth 

(S2) 

^death ^birth 1 

^death 


Y 


t = ^birth 

(S3) 

^death ^birth 


where fbirth and fdeath are respectively the birth and death of the gathering, g t is a temporal 
slice, G = (/birth U < 7 birth+i U • • • U (/death is the aggregated structure, and J is the overlap 
(J = \i fl j\/\i U j|), defined as zero if the gathering has only existed for one time bin. Palla 
et al. [21 applied a related measure to estimate the stationarity of communities. Varying the 
partition threshold (Fig. |S6| ;) we observe a maximum in both measures, indicating a regime 
where gatherings are both temporally and globally stable. Threshold values of d > 1/2 are, 
however, problematic since they merge gatherings that split into two equally sized parts 
together with both parts, or vice versa. In this scenario, we find that the desirable behavior 
is to declare the old gathering as ‘dead’ and identify two new gatherings as born. Therefore, 
to achieve optimal stability and to avoid issues with unwanted merging we partition the 
dendrogram at d = 0.49. 


S3.1.2 Temporal decay function 

Here we investigate how robust the inferred gatherings are to perturbation of the 7 -parameter 
and using an alternate decay form. To compare two set of gatherings (identified using 
different 7 -values) we calculate the average maximal overlap between individual gatherings 
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Figure S5: Illustration of constructed dendrogram, depicting distance (d) between groups identified 
across 5 timebins. The tree is constructed using an exponential decay function with 7 = 0.4. Two 
gatherings, orange and blue, are inferred by thresholding the tree, where all groups below or equal to 
the threshold (d = 0.4,) are merged. 



C 



Figure S6: Stability mea¬ 
sures of gatherings, a, Il¬ 
lustration of the local sta¬ 
bility measure, calculated 
between consecutive slices, 
b, Global measure calcu¬ 
lated between each slice and 
the aggregated structure, c, 
Global (a) and local (rj) sta- 
tionarity of gatherings as a 
function of partition value 
d, averaged over all gath¬ 
erings identified in Jan¬ 
uary 2014- Gatherings 
achieve optimal stability 
around d ~ 1/2. 
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where G 7 denotes the set of gatherings found using a specific value of 7 , and |G 7 | is the 
number of gatherings. Overlap is calculated using Jaccard similarity, J = \i D j\/\i U j\, 
where i and j are respectively gatherings from G 7 and Gy. Figure S7 shows the overlap 
matrix between identified sets of gatherings; in general it assumes overlap values above 
0.76 for any choice of parameters. Indicating that the gatherings are robust to even large 
perturbations. 



Figure S7: Effect of de¬ 
cay parameter on gather¬ 
ing robustness, calculated 
using Equation \Sf\ Color- 
bar shows average overlap 
between two sets of gather¬ 
ings, and never drops below 
0.76, indicating the robust¬ 
ness of the procedure with 
respect to parameter pertur¬ 
bations. 


S3.1.3 Gathering timescales 


The outlined method identifies multiple gatherings, some only exist momentarily while 
others are sustained for long time periods. One can easily imagine brief encounters between 
good friends as being more meaningful than prolonged interactions between individuals 
commuting to work; this therefore raises the question of which meetings are meaningful 
and which are not. 

Here we simply adopt the convention developed by the Rochester Interaction Record 33 , 
where meaningful encounters are defined as those lasting 10 minutes or longer. In order 
to filter out spurious connections, we impose the requirement that a gathering must be 
observed for at least 4 consecutive time slices to be represented in our statistics. 

While dynamics on 20 minutes+ timescales describe the overall evolution of a gathering, 
micro dynamics on 5-minute scales represent everyday events such as going to the bathroom. 
Gatherings, therefore, might disappear and reappear within very short time-intervals 


(Fig. |S 8 ^t), in such cases we use imputation, see Fig. S 83 . 
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Figure S8: Gathering micro fluctuations, a, A gathering appears, disappears, and re-appears 
moments later, b, If this occurs in rapid succession we treat the gathering as if is was present in bin 
ts with the same nodes as in bin t 2 • 


S3.2 On- & off-campus gatherings 

Gatherings are not geographically constrained and therefore free to occur anywhere, in this 
section we focus on distinguishing between gatherings that occur on- and off-campus. We 
do this by applying data acquired through the WiFi channel, where each mobile phone 
scans for nearby wireless network access points (AP) every 5 minutes or less and logs their 
unique identifier and name. The entire university campus is densely covered by wireless 
networks and it is therefore highly unlikely that students located on campus will not see an 
university AP. Because the names of campus APs follow a uncommon and standardized 
naming scheme we can infer when a student is close to a campus access point and use 
this as a proxy of being on or off campus. For each participant we construct a 5-minute 
binned vector containing binary values 0 (not on campus) and 1 (on campus). Because 
gatherings are an ensembles of people we perform majority voting across nodes for each 
time-bin to determine whether the gathering was on campus or not in that specific time-bin. 
Since gatherings are spatio-temporal entities, we also perform a majority voting across 
all time-bins to achieve a hard split and determine whether a gathering mainly occurred 
on- or off-campus. This yields 13 872 off- and 9195 on-campus gatherings and with an 
92.84% average agreement between votes. The primary reason for disagreement are mobile 
gatherings traveling to or from campus (e.g. on a bus or walking). 


S3.3 Gathering statistics 


Gatherings show a broad distribution in both size and duration, see Fig. |S9[ Dividing 
gatherings into on- and off-campus categories reveals that meetings occurring on university 
campus are larger (Fig.|S9p), but have considerably lower probability of lasting longer than 4 


hours (Fig. S91). This suggests that large meetings are mainly driven by the class schedule, 


while meeting duration is determined by social context. Because meetings occurring outside 
of campus often require increased coordination, our hypothesis is that once groups meet, 
they will interact across longer periods of time for the meeting to pay off with respect to 
the organizational cost. 

It is also interesting to consider the duration of each meeting as function of the total 


number of nodes that participate in it. Figure [SlOp . shows broad distributions of duration 
across all sizes, however, both the mean and median are quite stable and reveal that small 
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Size Duration (hours) 

Figure S9: Statistical features of gatherings. Summarizing size and duration distributions for 
all gatherings (panels a-b) and conditioned on off/on-campus meetings (c-d). a, Gathering size 
distribution, b , Gathering duration distribution, c, Gathering size distribution for on- and off-campus 
meetings, d, Gathering duration distribution, based on location (on/off-campus). 
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gatherings on average have shorter durations compared to larger meetings. Further dividing 
the data into on- and off-campus categories (Fig. S10 | j) shows that small meetings on and 
off campus are quite similar with respect to duration, while larger meetings tend to last 
longer, provided that they occur off-campus. 


a 



b 



3 5 10 15 20 


Size 


Figure S10: Duration as function of gathering size, a, Violin plot shows the distribution of durations 
as function of size, summarized across all gatherings, b, Box plot of the duration distributions 
divided into on- and off-campus meetings. 


Combining the statistics above with Eqs. ( |S2|S3[ ) we can explore how gathering stability 
depends on size and duration of meetings. Figure [STTf i reveals that the local churn between 
consecutive time slices is quite constant, irrespective of size, indicating that there is a low 
turnover of nodes between slices—on average much lower than predefined by the partition 
threshold. For small gatherings, however, we observe finite size effects due to the partition 
threshold (see sec. S3.1.1). Global stability is lower, but also fairly independent of gathering 


size. With respect to duration (Fig. SI l| b) , local stability increases as meetings duration 
increases, revealing that longer meeting have lower turnover of nodes between consecutive 
slices. The global measure shows similar behavior. It achieves a slightly lower stability and 
shows that gatherings are globally stable independent of duration. This combination of 
a constant turnover between slices and a global stability suggests that gatherings contain 
groups of highly interacting individuals, that are present throughout the entirety of the 
meeting, while other individuals participate infrequently and are constantly being replaced. 
Similar social structures have previously been observed in the social science literature where 
the individuals have been defined as core members [34] . 

Finally we can look into specific temporal patterns with regards to when on and off 
campus gatherings occur, i.e. in which 5 minute time-bins they first appear and later 
disappear. Figure S12 1 reveals that on-campus gatherings have increased probability of 
occurring exactly on the hour, while off-campus meetings are evenly distributed—clearly 
showing the effect of the class schedule. A similar case is seen for the probability of dissolving 
(Fig. |S12|j), where on-campus meetings mainly end on integer hour values. 


S3.4 Temporal communities 

So far each gathering only contains information about its local appearance, to gain a 
dynamical picture we match gatherings across time. Due to soft boundaries, a strict 
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Figure Sll: Global and local gathering stability, a, Average stability as function of size, averaged 
across all gatherings with a specific size. Full lines denotes mean, while shaded areas shows the 
standard deviation of the mean, b, Average stability as function of duration, binned into one hour 
wide bins. Fully drawn lines denote the mean, while shaded areas illustrates its deviation. 



Time of hour Time of hour 

Figure S12: Summary statistics of gathering temporal patterns, summarized for on- and off-campus 
meetings. Red line denotes the uniform probability distribution for the case where all states are 
equally probable, a, Probability of a gathering appearing, calculated from the first time-bin in which 
we observe it. b, Probability of dissolving, i.e. the last time-bin where we observe a gathering. 
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matching criteria is not a feasible method, since a person who coincidently walks past a 
group might be included in it. Thus, we expect noise to be present in each gathering. To 
mitigate this effect we instead match gatherings according to the participation levels of their 
constituent nodes. Counting the fraction of times a nodes has been present in the gathering 


we construct a normalized participation profile, see Fig. S13 r. Again, gatherings are matched 
according to their individual participation profiles using agglomerative hierarchical clustering. 
Since nodes, however, no longer assume binary values but may assume participation levels 
in the interval 0 < n* < 1, we calculate distance based on a continuous version of the 
Jaccard similarity: 

N 

52 mil i{Gi,Gj) 

n =1 

~N 5 

52 max (Gi, Gj) 

n=1 


D(Gi,Gj) = 1 - 


(S5) 


where Gi is a vector containing node-wise participation values for gathering i, and N is the 
total number of nodes in Gi U Gj. The two functions max and min act piecewise on the 
two vectors, and D(Gi,Gi ) is defined as 1 between two gatherings that have zero overlap, 
When merging clusters of gatherings ( Q ) we apply the average linkage criterion and define 
the average distance between them as 


D(g,g') = 


i 


\g\\g’\ 


E E D ^ G \ 

G&g G'eS' 


(S6) 


where \Q\ denotes the cardinality of a set of gatherings. Other linkage criteria can also 
be used, such as complete or Ward-linkage. Iteratively this method builds a dendrogram 
with gatherings as leafs. Thresholding the tree partitions similar gatherings together into 
communities. A community then consists of all nodes from its constituent gatherings, but 
it also inherits their individual participation profiles. Thus we need a method to construct 
a community participation profile from its subcomponents. This can be done using two 
methods: weighted or unweighted, with the differences illustrated in Fig. S13| n The 
unweighted method takes into account the gathering participation profiles and calculates 
the average, weighing each gathering equally: 


1 




(S7) 


Geg 


The weighted version instead assigns each gathering a weight according to its lifetime (riif e ), 
i.e. number of temporal bins it has been present: 


^weighted_ 


i 


52 Flife,( 


E r life,GG'- 


(S8) 


c&g 


Both measures comparatively construct similar dynamical communities and yield similar 
overall statistics; we choose the the weighted version, because it is slightly less influenced 
by noise. 
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Figure S13: Illustration of gathering participation profiles, a, Each vertical bar indicates the 
overall fraction of time a node has spent in the gathering. Values are normalized according to the 
total gathering lifetime, b, Illustration of how to construct a community participation profile based 
on its constituent gatherings. Top figures depict two gatherings with unequal lifetimes. Bottom plots 
illustrate the principles behind the unweighted and weighted methods. Bar colors indicate the role 
each node plays in the community profile. 


S3.4.1 Optimal clustering partition 


Applying hierarchical clustering to merge similar gatherings into communities again leaves 
one open question: Which partition value is the optimal? Using a similar line of reasoning 
to what we presented in in Sec. S3.1.1 we can argue that a threshold value of D = 0.49, is 


preferable. It is, however, possible to estimate the exact optimal threshold value, which we 
can compare to our hypothesized guess. Applying a heuristic inspired by the Gap statistic 
introduced by Tibshirani et al. [35], we compare the clustering according to a null model 
distribution (for a comprehensive survey of methods see Milligan and Cooper 136 j). Given 
a total of m gatherings clustered into k clusters (communities): C\, C 2 , ..., C& we calculate 
the within-dispersion measure as, 


w >‘ = Y.*rr\ £ D <f 


i 2|Ci 
r=l 1 


i,jECr 


(S9) 


where Dij is defined in equation S5 and denotes the pairwise distance between gatherings i 
and j that both belong to cluster C r . Again | • | denotes the cardinality of a cluster, i.e. 
the number of gatherings that are clustered in C r . The factor 2 takes double counting 
into account. Thus Wk is the accumulated within cluster sum of differences around the 
cluster mean. Applying the principles from Tibshirani et al. [35] we compare log(Wfc) to 
an expected value generated by a null model distribution of the data. The gap measure is 
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defined as 


B 

Gap(fc) = 1/B J2 log (W kb ) - log(W fc ), (S10) 

6=1 

where B denotes the number reference data sets. The optimal number of clusters is then 
the value of k for which log(Wfc) falls furthest below the reference data curve, i.e. the value 
of k = k* such that 


k* = argmin{fc|Gap(fc) > Gap(fc + 1) — Sfc+i}, 

k 


(Sll) 


where s = Sfc y 7 (1 + 1/5) and s & is the standard deviation of log (Wkb) over the B synthetic 
datasets. Each null model is constructed by assigning random participation values, chosen 
from a uniform distribution, to random nodes, thus creating reference gatherings with 
similar size distributions. According to Fig. S14 the gap statistic achieves a minimum, 
indicating that the optimal place to cut the dendrogram is at distances of D = 0.50, in 
good agreement with the previously stipulated value. A threshold value of D = 0.50 is, 
however, problematic since it will cluster gatherings with 50% overlap. To avoid this issue 
we cut our dendrogram at D = 0.49, merging 23 067 gatherings into 7320 distinct dynamic 
communities. 
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Figure S14: Estimating the optimal number of clusters using the gap measure, with the number of 
clusters k being directly related to the threshold value D. 


S3.5 Dyadic relations 

Dyads are a special case of social relations and are hypothesized to be qualitatively different 
from groups (T, 29 , but the way we identify them is similar to the outlined framework in 


Section S3.1 Since dyads can only be components of size two, tracking their evolution 


is trivial, because we can apply a strict merge criterion, i.e. require 100% overlap. This 
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also applies in the case when tracking repeated appearances (dyadic cores) across the full 
duration of the dataset. Thus, they only distinction between a gathering and a dyadic 
gathering lies in the fact that size of the gathering will always be fixed, meaning if a dyad 
evolves into a group (of any size larger than two) then we claim that the dyad in question 
has ended and a new group relation has been initiated. In accordance with gatherings, we 
require dyadic gatherings to be present for at least four consecutive bins (20 minutes) to be 
considered as meaningful. In total we observe 34996 dyadic gatherings and 4844 unique 
dyads. 


S3.5.1 Dyad statistics 

Similar to group relations, dyadic gatherings produce a broad distribution of durations 
comparable to the gatherings of groups (Fig. |S15^ .). In addition, dyads also produce a 
broad distribution of repeated appearances, see Fig. S15 | j, with a majority of gatherings 
only being observed once, while others on average can appear multiple times per day. 




Figure S15: Statistical features of dyadic gatherings, a. The distribution of duration for dyadic 
gatherings, b , The distribution of the number of appearances. 


S4 Cores 


In the previous section (see Fig. S11 | j) we found that certain individuals are present for a 
majority of a gathering’s lifetime. In this section, we begin by describing how such cores 
are extracted from temporal communities. Then, we describe the fundamental statistics of 
cores (distribution of appearances, distribution of sizes, individual membership in cores, 
subcore-statistics), as well as the key differences between work- and recreational cores. 
Finally, we study the temporal patterns and entropy of core-appearences. 


S4.1 Extracting cores 

Nodes within each community have varying attendance (see Fig. S13^ ), some are only 
members for a limited time, while others interact over extended periods of time. Thus, 
participation profiles show pronounced core structures, highlighting individuals that act as 
‘generators’ of each community. 
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Consider the participation levels of individuals as ordered profiles (Fig. S16i), where 


each bar denotes the fraction of time a node has has spent in a community relative to 
the community’s total lifetime. A significant gap in this profile identifies core nodes. We 
compare this to a participation profile generated by a random process (Fig. S16 | j), where 
we pick a random participation level between 0 and 1 (for each node) from a uniform 
distribution. The maximal gap in this random profile thus tells us whether the real gap is 
significant. We can estimate the average expected gap size and deviation by generating 
many (N = 10 000) random participation profiles. Generalizing this notion to all sizes of 
communities we evaluate how significant a gap is compared to the expected value generated 
at random. The decision boundary in Fig. |S17 divides gap sizes into two regions. If the 
actual gap is greater than the average null-model gap /Random plus one standard deviation 
<7random> we define the core to be significant. Thus, we only keep cores with gap sizes above 
Mrandom + <7 ran dom- According to this criterion, we find that 7 146 out of the 7 320 (97.6%) 
inferred communities display a pronounced core structure. 
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Figure S16: Extracting cores from community participation profiles. Dark gray bars denote nodes 
with participation levels above the maximal gap. a, Ordered participation profile for a community 
composed of 15 individuals, b, Similar as in panel a but generated from a uniform random distribution. 


S4.2 Core statistics 


Cores have a broad distribution of the number of appearances (Fig. S18^ i), ranging from cores 
appearing only once to on average occurring multiple times per day. The size distribution 
is also heavy-tailed (Fig. S18 } j). 

To produce meaningful temporal statistics we henceforth focus on cores which on average 
are observed more than once per month, this limits our focus to individuals that appear in 
these. Fig. S19 shows the distribution of the number of cores each individual is part of; it 
is a broad distribution with a majority of individuals partaking in few cores while a small 
minority of users are extraordinary social and have more than 10 cores. 
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Figure S17: Core selection boundary. De¬ 
cision boundary calculated from N = 10 000 
independent trials for each size. Black line de¬ 
notes the mean max gap value while error bars 
indicate the standard deviation. 




Figure S18: Core statistics, where we focus on cores of size greater than two. a, Probability 
distribution of the number of appearances per core, b, Size distribution. Inset shows raw numbers 
for specific sizes. 



Figure S19: Distribution 
of the number of cores per 
individual. Calculated for 
users that appear in fre¬ 
quently observed cores, i.e. 
cores that on average are ob¬ 
served more than once per 
month. A minority of indi¬ 
viduals partake in more than 
10 cores, with the distribu¬ 
tion being centered around 
a mean value o/~ 5. 
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S4.3 Subcores 


Because cores span a wide range of sizes small cores can appear as subcores embedded 
within larger ones, see example in Fig. |S20[ r. In fact, our methodology allows for and 
identifies highly overlapping and hierarchically stacked structures. We define a core to 
be a subcore if, and only if, it is fully contained in the larger one. Figure S20 | j shows a 
broad distribution in the number of subcores that are contained within individual cores, 
with a majority of cores only containing one subcore, while other can contain more than 
ten. There is of course a dependence on size, such that bigger cores have larger probability 
of containing more subcores. Figure S20: quantifies this phenomenon by considering the 


fraction of subcores that cores of size s contain. 



Figure S20: Subcore structures and statistics, a, Illustration of overlapping and hierarchically 
stacked core structures, b, The distribution of the number of subcores contained within individual 
cores, c, Fraction of cores that contain 0, 1, 2, 3, 4, and 5+ subcores (sc) as function of core size. 


S4.4 Work &; recreational cores 


In Sec. S3.2 we determined for each gathering whether it occurred on or off campus. 


For cores this distinction is not possible since cores consist of multiple gatherings that in 
principle can occur both on or off campus. Instead we count the number of times each 
core appears on and off campus and perform a majority voting. Cores that have a higher 
frequency of on-campus meetings are denoted as work cores, while we call cores that appear 
more frequently off-campus as recreational cores. In case of a tie, we label the core as 
recreational. Figure S21 1 shows the voting schedule and indicates the split, while Fig. S21| j 
shows the waiting time probability between consecutive meeting events. For work cores 
the waiting time probability shows clear signs of daily and weekly patterns, suggesting 
that these cores may be driven by the class schedule. Recreational cores on the other 
hand exhibit a more subtle pattern that slowly decays and is considerably higher during 
nighttimes—suggesting that two fundamentally different mechanisms drive the activity 


of the two groups. Splitting the number of cores per individual (Fig. S19) up into the 
two categories yields the results shown in Fig. S21 >d, where users have a broad degree 


of recreational cores, on average participating in 2.53, while the number of work cores is 


more localized with an average value of 2.74. According to Fig. S2l3-f individuals spend 
more time in recreational cores than work cores, clearly depicting that context has great 
influence on the properties of meetings. 
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Figure S21: Distinction between work & recreational cores, a, Excerpt of voting scheme depicting 
the number of on- and off-campus meetings for each core. Gray line splits the area into work (above) 
and recreation (below) categories, b, Inter event time distributions between consecutive meetings , 
aggregated across all cores of similar class (work/recreation). Events are hourly binned and full lines 
denote moving averages, calculated using 4-hour windows, c, Number of work cores per user. d. 
Number of recreational cores per user, e, The distributions of how much time individuals, in total, 
spend in cores with work and recreational context. The average individual spends approximately 
38 hours in a work setting, and 120 hours in a recreational context during the period of study, f, 
Cumulative probability distribution of data shown in panel e. 
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S4.5 Meeting regularity 


Each core has a specific temporal pattern linked to it denoting the periods of time it has 


been present, see Fig. S22i for an example. The information contained in each pattern can 
be quantified using Shannon entropy [37], defined as 

(S12) 


H = - ^pt log 2 p t , 


where the sum runs over all temporal bins t and pt is the probability of observing a specific 
core within given time-bin. For each core we aggregate its meeting patterns across the full 
study duration into weekly 1-hour bins, then within each bin we calculate the probability of 
observing the core. Entropy is calculated individually for cores using Eq. S12 According to 


Fig. S22}} there is clear differences between the meeting patterns for work and recreational 
cores. Further, the entropy distributions (Fig. S22;) reveal that recreational cores, on 
average, have higher entropy and thus lower meeting regularity—indicating that they do 
not meet within pre-scheduled temporal-bins. 





Figure S22: Meeting regularity of cores, a, Example pattern for a single core, denoting the 
probability of observing the core. Data is aggregated across all weeks into 1-hour wide bins, b, 
Aggregated meeting patterns across all cores, showing the probability of observing work and recreational 
cores, c, Distributions of meeting time entropy calculated across all cores and divided up into work 
and recreation categories. 


S4.6 Ego viewpoint 

So far we have mainly focused on the overall structural and dynamical features of cores, but 
we can reverse the perspective and look at cores from the perspective of individuals. From 
this perspective, each core provides a social context for the situation in which a person is 
embedded, whether it is in a work, recreation, or other relation. Figure |S23^ , shows the 
involvement of a representative individual that is involved in multiple cores, producing 
hierarchically nested and overlapping structures. The corresponding temporal pattern of 
cores (Figure [S23 [ j) reveals a complex behavior. 

S5 Predicting behavior from routine 

In this section we describe the detailed analysis leading up to routine-based geographical 
location and social context prediction presented in the main text. 
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Figure S23: Ego-centric perspective of cores for a representative individual, a, Network perspective, 
revealing overlapping and nested structures, b, Temporal dynamics of individual cores, colored 
accordingly. 


Song et al. have used entropy, an information theoretic measure in order to estimate 
the upper bound of the predictability of individuals’ mobility patterns |!8||. We argue here, 
that in analogy to geospatial behavior, human social life can be described by a temporal 
sequence of ‘social states’. These can be used to quantify the predictability of social life. 

Given a sequence of states for an individual i we can define entropy in two ways. First 
we can think of predictability in a temporally uncorrelated sense with entropy defined as 


Ni 


ST = -Vft-logaft-, 


/ . J 

j 


(S13) 


where pj is the probability of observing state j and iVj is the total number of states observed 
by person i. Eq. |S13 captures the uncertainty of your location history without taking 
the order of visits into account, thus discarding information contained in the daily, weekly 
and monthly sequences of behavior. A more sophisticated measure that includes temporal 
patterns is temporal entropy: 


s tem P = _ J- p(T')log 2 \p(T')}, 


(S14) 


T'cTi 


where p(T[) is the probability of finding a subsequence T[ in the trajectory T*. From the 
entropy one can estimate the upper bound of predictability (II,:) by applying a limiting case 
of Fano’s inequality [8,19,38]: 


Si = H(Ui) + (1 - IF t ) log 2 (N - 1), (S15) 

where H(Ui) = -IT log 2 (IT) - (1 - IT) log 2 (l - IT). 

S5.1 Comparing with previous studies 

In the main manuscript (Fig. 3a) we show that our ability to predict the location of 
individuals has an upper bound of 71% which is significantly lower than the 93% reported 
by Song et al. |8j. The main reasons behind this discrepancy are discussed below. 
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S5.1.1 Difference in populations 

Our study population is comprised of university students that (1) not necessary have a 
single home/nightly location, (2) have a rich free time which they can utilize to explore 
new locations, and (3) have multiple work locations due to classes being distributed across 
a large university campus. 

In contrast, Song et al. studied a sample of 50 000 individuals selected from a total 
population of ~ 10 million anonymous mobile phone users. The users were chosen according 
to (1) their travel patterns, where individuals had to visit more than two cell tower locations 
during the observational period of three months and (2) their average call frequency which 
had to be > 0.5 hour -1 , effectively selecting individuals with at minimum of 12 calls per 
day. 


S5.1.2 Geospatial resolution 


Previous studies have applied call detail records (CDR) as proxy for location, inferring the 
position of individuals depending on which cell tower their mobile phone is connected to 
during a call |6, 8]j39j. While the granularity of cell tower locations in cities is around 800 
meters it can be on the order of kilometers in more rural areas 140 . Figure S24 illustrates 
the effect of using cell tower data for positioning, as it can cluster otherwise distinct places 
together as one. Our location data on the other hand has a typical accuracy below 60 
meters [10], enabling a more accurate spatial estimation. 



Figure S24: Cell tower resolution of a city. Otherwise distinct places can be grouped together 
under the same cell tower, coarse graining the geospatial position and increasing the probability of 
guessing correct. 


S5.1.3 Effects of binning 

Song et al. |8i| chose, because of data granularity, to segment their data into one hour 
time-windows. Since our data has finer granularity, down to the minute scale, we can choose 
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a different resolution, but which is optimal? 

We show in Fig. S25r that the finer we segment time, the better we are able to predict 
your location in the next time-bin. By reducing temporal bin size it is possible to achieve 
arbitrarily high levels predictability because segmenting data into finer time-windows 
increases the number of bins, which in turn leads to respective states obtaining a higher 
frequency of visits. 

The bin-size affects both temporal and uncorrelated entropy, however, so one hypothesis 
is that it is still meaningful to consider the ratio between the temporally uncorrelated 
entropy and the temporally correlated entropy. To investigate, w e stu dy on the ratio 
gtemp/gunc , evaluated across all individuals i. As is clear from Fig. S25 3 the ratio shifts 


towards lower values for small windows, revealing that the choice of bin width has a greater 
impact on temporal entropy. This suggests that predictability is greatly influenced by the 
choice of time-window, as has also been noted by 20,38 . In short, considering the ratio 


gtempigunc ^ oes no t so lve the binning problem. Ultimately this implies that the smaller 
bins we apply the better we are at predicting. Because we currently are unaware of any 
timescale that is fundamentally descriptive of human behavior, we chose to work with 
temporal sequences in their natural form instead of segmenting them, predicting ‘next state’ 
rather than ‘state of next time bin’ (see Fig. S26). 




Figure S25: Effects of binning on predictability bounds, a, Predictability as function of window 
size. Inset shows a close up for high values of predictability. Segmenting time into finer bins yields 
higher bounds of prediction, b, Ratio between temporal and uncorrelated entropy values. As the size 
of time-windows is narrowed the ratio shifts towards zero, indicating that S temp <C S unc . 


S5.2 Data for prediction 
S5.2.1 Social vocabulary 

We describe the social life of an individual based on the context provided by cores, where 
we focus on individuals that appear in frequently observed cores (observed, on average, at 
least once per month). In order to include the full social life of an individual we incorporate 
the context provided by dyads as well as the the information contained in infrequently 
observed cores. Note that if a core/dyad is only observed once then we denote it as a ‘noise’ 
state. In addition we construct one supplementary context: ‘alone’ denoting periods of 
time where an individual is not socially active. Following Song et al. j8] we construct a 
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time-series of social contexts for each user. However, as noted above, we do not segment 
the sequence into temporal bins, but keep it in its natural form, and predict ‘next state’ 
(illustrated in Fig. S26) for both geospatial and social prediction. 


Temporal sequence 


Vocabulary 


H-► t 


At 


Natural seq.^ > ■■■> C 2 , C 3 , C,, ... 

Segmented seq.(^> ■■■> C,, C 2 , C 2 , C 3 , C 3 , C 3 , C, 


Figure S26: Time-series of states. Left, part of the figure presents social states as they naturally 
occur in a temporal sequence, with At denoting a segmentation of the sequence within arbitrary sized 
time-windows. Right panel illustrates the difference in vocabularies, where the ‘natural’ sequence 
focuses on the order of states, while ‘segmented ’ also weights states according to their duration. 


S5.2.2 Location vocabulary 

In addition to social context, we also collect geographical traces for each user, enabling us 
to reconstruct their mobility patterns. To infer context from raw location traces we use 
the same definitions as Cuttone et al. (411, where a point of interest (POI) is a location of 
relevance for a person, such as home, work, or a cinema. POIs are inferred by applying a 
density based clustering algorithm (42], with a density grouping distance of 60 meters and 
requiring stops to consist of at least two samples, meaning that a person must have spent a 
minimum of 15 minutes in the same location. 


S5.2.3 Convergence of states 


Figure S27 shows the distribution of the number of distinct social and geospatial states for 


increasing windows of time. After 90 days both probability distributions converge, implying 
that the number of states visited by users is saturated, indicating that we can uncover a 
majority of states frequented by individuals. We, however, expect this saturation only to 


be meta-stable, because the social networks change across adulthood 43 


S5.3 Prediction 


Following Eq. |S13||S15 we first calculate the respective entropies of the behavioral patterns 


for each individual. We show in Fig. S28i that our social patterns have lower entropy than 
our mobility. The figure shows that an average person approximately occupies 2 2 ~ 4 social 
states and 2.5 2 ~ 6.25 location distinct states. We also find that humans are potentially 
more predictable based on their social contexts than their past locations, see Fig. S28 |j. 


Previous studies have found higher levels of predictability [8(9], see Sec. S5.1 for a full 
discussion. 


Fig. S29r shows the interrelation between Sfff ial and Sfffat ion , surprisingly there is no 


correlation between the two measures (Spearman correlation p = 0.053, p-value = 0.191), 
indicating that humans can be highly predictable in a social sense but very unpredictable 
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Figure S27: Distribution of the number of distinct states within a time window, a, Convergence of 
the number of social social states, showing saturation after a time window of 90 days, b. Saturation 
of visited locations, also convergence after 90 days. 


location-wise and vice versa. A similar lack of correlation is observed for S temp (Spearman 
correlation p = —0.008, p -value = 0.84), see Fig. 


S29 3. 




Entropy, S Predictability, n 


Figure S28: Probability distributions of entropy and predictability, a , Distributions of temporal 
and uncorrelated entropy for location and social behavior. As expected, temporal patterns contain 
more information than just frequency of visits, and hence have a lower entropy, b. Predictability 
distributions for uncorrelated and temporal patterns. On average, our social behavior is more 
predictable than our geospatial behavior. 


S5.4 Temporal aspect of predictability 


According to Fig. |S29 there is no correlation between social and geospatial aspects of 
human life, but this is measured in terms of overall dependence. In real life we have varying 
degrees of predictability, a simple way of visualizing this is to look at the number of states 
as function of time. According to Fig. S30r we have a low number of location states during 


nights (resulting in low entropy) because we mainly sleep at a single location, while during 
days and evenings our entropy is higher, in part because because we occupy more states. 
Note that there is very little overlap between locations visited in each 8-hour bin, e.g. 
morning locations are different from day locations. On Fridays we visit more locations than 


43 














3 



1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 
Location ( S un J 


b 



Location (S temp ) 


Figure S29: Correlation between social and location entropy, a, Mutual dependence between 
uncorrelated entropy for social and location states per individual, b, Correlation between temporal 
entropies for social and location states. 


any other day, while during weekends we are more stationary. If we, however, consider the 
total number of distinct visited places (Fig. S30 }d) we see that Fridays and Saturdays are 
special because those days are used to explore new locations. Therefore, predicting location 
during weekends based on routine is more difficult, since we have higher entropy during 
these periods. Our social behavior (Fig. S30| ;) resembles our mobility, where we socialize 
mainly during the day and less during the night. Weekends are again special, interestingly 
we here observe a drop in in the number of social states, because we are not required to 
go to work or school. Fig. S30 I shows that the number of social states decreases during 
weekends, meaning our participants reserve weekends to socialize with a few selected friends. 



Figure S30: Nested histograms showing the temporal aspects of predictability. Binned using daily 
and 8-hour intervals (12 am - 8 am, 8 am - 4 pm, f pm -12 am), outer bars (gray) denote days 
while inner bars (white) denote 8-hour windows. Bars do not necessary add up, because one can have 
an overlap of states between the 8-hour bins. All values are averaged across the student population, 
a, Number of average observed locations per bin. b, Total number of visited distinct locations, c, 
Average number of social states per time-bin. d, Total number of distinct social states. 


Here, it is important to realize that we only observe social interactions among participants 
of the experiment. Wile the geospatial data is sampled evenly over the observation period, 
social interactions might have a potential bias, as it is possible to go out with non-university 
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friends on Fridays and weekends. We address this issue, in Fig. 5b (main manuscript), by 
quantifying the geo-spatial behavior of cores. Whenever a core is present we pool together 
the geographic locations of its individuals, bin the resulting list of (latitude, longitude)- 
points in 0.01 x 0.01 cells, and quantify the behavior using uncorrelated entropy. Using 0.1 
x 0.1 and 0.001 x 0.001 cells produces comparative results. Displayed in the figure is the 
average entropy, averaged over all cores and binned in 8-hours bins (as described above). 


S6 Social prediction 


It has previously been shown that spatial behavior between individuals that share a social 
tie is correlated 15-18 . But the onset of co-presence lacks a temporal signature, so while 


the spatial traces overlap it is not generally possible to specify exactly when a friend is 
predictive for an individual’s behavior. Cores, however, do provide such context. An 
incomplete set of members provides a clue that a social interaction is about to occur (i.e. the 
final group member is about to arrive). 

We test this concept on cores of size three; thus provided we observe two members 
we measure the probability of the last member arriving within the next hour. To avoid 
testing the hypothesis on scheduled meetings we focus on weekday nights (6 pm - 8 am) 
and weekends. This is the period where routine driven prediction is at its weakest. Further, 
in order to avoid circularity, we evaluate the hypothesis on a test month (May 2014), during 
which we have not identified gatherings. 


S6.1 Null models 


For evaluation purposes we compare cores to two reference models, both generated from 
real world data. We segment interactions into undirected and unweighted daily graphs, 
see Fig. S31 1 . The first null model, we which we denote random , constructs reference 
groups by randomly drawing nodes from each daily graph. The second model utilizes a 
breadth-first search in the daily graph to create reference groups starting from a randomly 
chosen seed node by searching its local neighborhood. A new seed node is chosen if the 
search is restricted to components with fewer than three members. BFS is a strict null 
model and requires all nodes to have shared a physical interaction. We disregard reference 
groups if they happen to be identical to a core. 


S6.2 Comparison 


We test the hypothesis on a sample of 340 frequently occurring cores and 10 000 reference 
groups for n = 100 independent trials. For 33 cores we never observe an incomplete set of 
members during the month of May, therefore they cannot be used for prediction and are 
disregarded. 

As shown in Fig. S31 3 approximately 50% of cores are predictive, in stark comparison 
to BFS reference groups, with the random null model performing even worse. 
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Figure S31: Social prediction, a. Daily graph of interactions, illustrating cores and construction of 
null models, b Percentage of socially predictive cores within each category. A group is predictive if it 
at least once has correctly predicted the arrival of the missing individual. For the reference models 
errorbars are calculated across n = 100 independent trials. 


S7 A dynamic random geometric graph model 

In order to investigate the statistical significance of the results found in the empirical data, 
we create a null model to simulate full network dynamics. 

The simplest candidate for a model system is based on random geometric graphs 
(RGGs) |44 which incorporate random walks. We model a scenario where individuals move 
randomly and connect to each other when they are physically proximate. This model has 
three parameters. The random walker step size £. population density p and contact radius r 
(of which only two are free after rescaling). In the current setting, it is natural to choose a 
scale for this model that lets us interpret the results in units that correspond to our actual 
real-world system, thus we connect two links if their distance is less than r = 10 meters, 
corresponding to the typical Bluetooth range used in the main text. 


S7.1 Single time-slice RGG parameters 


Random geometric graphs have been used as models for complex networks in the literature, 


see Ref. 45 for an overview. In a RGG, we expect to find a single giant connected component 


when r is much greater than the typical distance between neighboring nodes. When r is 
much smaller than the typical distance between neighboring nodes, the random geometric 
graph has many isolated components with the component size distribution approximately 
exponential [46 . Near the critical threshold r c , the distribution of component sizes follows 
a power-law 1461. 

For our purposes, RGGs are interesting as a model for our real-world system around 
r c because gatherings in the real-world network display a power-law distribution of sizes, 


as suggested by the distribution of gathering sizes displayed in Figure |S9| Staying with 
parameters chosen to correspond to the empirical data, we construct a random geometric 
graph with n = 800 nodes distributed randomly in a Lx L square, corresponding to density 
p = n/L 2 , see Fig. S32 4 for an illustration, to remain consistent with the literature, we use 


open boundary conditions when forming components [46] . 

Using the interpretable ‘physical’ parameter choices described above, we can estimate 
p based on the critical threshold for RGGs reported in the literature j46j, 47). This yields 
p = 0.014 nodes/m 2 , in accordance with our numerical simulations, see Fig. S32]3 (inset 1). 
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Figure S32: The RGG model (parameters as described in the main text). A. A snapshot of the 
model near the percolation threshold; notice how components do not form dense clique-like structures. 
B. The normalized distribution of component sizes in many realizations of the RGG model (red) and 
as observed in the empirical data (black). When the random geometric graph is near the percolation 
threshold, both systems display power distributions of component sizes. Inset (1) shows the average 
largest Connected Component size (normalized to total number of nodes) as a function of density. 
Inset (2) shows the real and model component size distribution at the critical point. C. The Average 
shortest path length inside components in the RGG (red). Shaded areas denote standard deviation. 
The Average shortest path length inside components in real data (black). Notice how the average 
shortest path length tends to be significantly larger in the sparse, spatially extended components 
generated by the RGG model. 


The resulting distribution is shown in Fig. S32p (inset 2). At the critical point, the 
distribution of component sizes in the RGG model is broader than the distribution found in 
real-world data and with a noticeable cut-off due to the finite size effects. By reducing the 
model’s density by a small amount (to p ~ 0.01), we find that the distribution of component 
sizes in the RGG model can be tuned to match the real-world component size distribution 
reasonably well. (Details on distribution matching can be found below.) The final single 
time-slice RGG component size distribution for the ‘physical’ parameters defined above is 
shown alongside its real-world counterpart in the main panel of Fig. S32[l 


S7.2 Model versus the real world 

While the distributions of component sizes in the model is qualitatively similar to what we 
find in the real data, it is important for our understanding of the model’s limitations to 
point out that there are significant differences between the RGG and real-world data. 

Firstly, the spatial distribution of node-locations is very different, from p = 0.01 nodes/m 2 
and n = 800 nodes and resulting L ~ 283 m. In contrast, the real data-set finds individuals 
distributed in small clusters throughout the entire greater Copenhagen region (L rea i ~ 20 000 
meters, cf. Ref [l0|). 

Secondly, while both the real data and the RGGs have power law distributed component 
sizes, see Figure [S32| 3, the internal structure of components in the two models are quite 
different. In Figure [S32| A we show a realization of a RGG for the final parameter values 
discussed above. Nodes that are within a radius of r = 10 m are connected by a link. While 


more clustered than components in a Erdos-Renyi graph 46 , the component we find in the 
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Figure S33: A. Distribution of gathering lifetimes for parameter values defined in the text. B A 
example of participation in cores in the real world data (from main text Fig. 3d). C An example 
of participation in cores in the dynamic RGG model. Notice how the dynamic RGG does not have 
recurring gatherings. 


RGG model are not clique-like as we observe in the empirical data (cf. main text Figure 1). 
Instead of being clique-like, the RGG components have long average internal paths and 
significant spatial extension (the distance between extremal points often representing a 
significant fraction of the side-length L). One way of quantifying the differences between 
the empirical components and the RGG components is considering the average shortest 
path length inside components in the two models. This is what we explore in Figure [S32p , 
where we quantify how paths are significantly longer in the random geometric graph than 
what we observe in the more clique-like components observed in the empirical data. 

S7.3 A dynamic RGG model 

So far we have only considered static random geometric graphs—describing how to generate 
a single time-slice with statistics that match our real-world data. We now implement 
graph dynamics by letting each node move according to a random walk; we use periodic 
boundary conditions for the random walkers. We implement the random walk as follows. 
At time step t, node i with location (xi(t), yi{t)) updates its position by selecting an angle 
6i(t) E [0,27r) at random and moving with step length l in that direction, ending up in 
(xi(t + l),yi(t + 1)) = (xi(t) + ^cos [0i(t)],yi(t) +i'sin[0j(t)]), modulo the periodic boundary 
conditions. 

We choose step size £ to produce gathering lifetimes corresponding to the empirical 
distribution, given the choices of values for n, r, p listed above. We match up the two 
distributions by iterating over values of £ and pick the step size which minimizes the 
Kullback-Leibler (KL) divergence 1481 between the two probability distributions. We 
find that £ = 0.65 (i.e. 0.65m in the physical interpretation) produces lifetimes with a 
distribution of gathering lifetimes similar to that of the empirical system, see Figure [S33| A 
for the two distributions. 

Thus, quite remarkably, the very simple dynamic RGG model based on random walks for 
each node is able to reproduce, not only the the distribution of gathering sizes (Figure[S32f3), 
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but also the distribution of gathering lifetimes. This implies that in terms of gatherings 
this very simple model is able to produce behavior quite similar to what we observe in the 
empirical data—at least in terms of distributions. 

S7.4 Random walks in 2D 

Let us quickly recapitulate the central results on random walks in 2EQ The first key result 
is that the the average displacement of a random walker is zero. This result simply means 
that if we choose a point (xo,yo) and start an ensemble of random walkers there, each one 
moving according to the rules outlined above, their average location will remain (a?o>2/o) 
independent of the number of steps. The second result is that after T steps, the root mean 
square displacement d ims of a walker with stepsize £ is d rms = l\[T. 

Taken together these two results characterize the behavior of individual random walkers. 
A walker does not remain at its starting point, but tends to drift, on average traveling 
some distance d in T ~ ( d/t ) 2 steps. The fact that the average displacement is zero simply 
implies that a walker is equally likely to drift in any direction. Thus if we find multiple 
walkers close to each other—as is the case for a gathering in our model—we expect the 
gathering to disappear after some time T, as the members each drift in a random direction. 


S7.5 The dynamic RGG model does not generate recurring meetings 


So far, we have shown that the dynamic RGG model is able to reproduce the distributional 
properties of our real-world data. That is, for a single time-slice, the model can generate 
component size distributions that are quite similar to what we observe in the empirical data. 
When we add a dynamic component to the model, allowing each node to move according to 
a random work, we see that the real-world distribution of gathering life-times can also be 
matched by this simple model. 

The dynamic RGG model, however, does not generate core^J Recall that cores are 
recurring gatherings, where a specific group of individuals meets repeatedly over time. Given 
the known dynamics of random walkers summarized above, the long-term picture finds 
each random walker drifting according to its own trajectory. Since there are no correlations 
between individual trajectories, the probability that the members of any gathering with 
more than two nodes will be in close proximity again, on the time-scale we are studying 
here (e.g. simulating ~ one month of data with parameters as above), is vanishing. Thus, 
we do not expect to see cores occurring more than once in the dynamic RGG model. 

This is precisely what we observe when we analyze the model data. Figure S33P is 
a similar to the main text’s Figure 3d, but this time showing cores with any number of 
occurrences (not just the recurring gatherings), displaying how a representative individual 
participates in her/his cores as a function of time. Time runs along the .T-axis and each 
vertical line represents an activation of that core. In the main text, we call this the path 


‘Random walks in 2D are well understood, see e.g. littp://mathworld.wolfram.com/RandomWalk2- 
Dimensional.html/ for a nice overview 

1 Technically, a given group of people only meeting a single time is both a gathering and a core (a core 
which occurs only once). Here, we use core in the sense that we do in the main paper—to signify recurring 
gatherings consist of 3 or more individuals and meet more than once per month on average 
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through the cores an individual’s ‘social trajectory’. In Figure S33P , we have generated a 
social trajectory for a representative node in the dynamic RGG model. The differences in 
core participation patterns illustrates the argument above nicely. Over the same time-period, 
the dynamic RGG model-node in question participates in nearly a similar number of social 
contexts, but all of them occurring only once; sometimes gatherings in the model can last 
multiple days. In the real-world data, however, a pattern of recurring meetings is evident. 
It is based on these patterns that we use the cores to make predictions. 

Thus, it is clear that due to the lack of recurring cores, the dynamic RGG model does 
not have predictability in the senses that we discuss in the main text. Let us be more 
explicit about this point. Recall that in the main text, we define predictability in two senses. 
Below, we discuss each one in detail. 


• Firstly, we discuss what we call the ‘social unit property’. The basic idea behind 
the social unit property is that, cores represent meetings between n > 2 individuals, 
characterized by the fact that all n members are usually present when the core is 
active. Therefore an observation of an incomplete set of core members implies that 
the remaining members will arrive shortly, a fact which can be used for prediction. It 
is trivial to see that cores in the dynamic RGG model do not possess the social unit 
property, simply because cores typically only occur once. 


• Secondly, we measure predictability as defined in Ref. [8| (see main text for details). 
The calculation for predictability in this sense depends on compressing the social 
trajectory. As is clear from Figure |S33P , the social trajectory for a node in the 
dynamic RGG model simply grows over time, continually introducing new gatherings 
that only occur once. This implies that the entropy for a node continues to grow 
as a function of time. The fact that the entropy keeps growing should not surprise 
us, given that social structures arising in the simple RGG model are based on a 
random process. The fact that the entropy is unbounded implies that there is zero 
predictability in the Ref. [8] sense as well. 


As noted earlier, RGG’s have been used as models for social systems in the literature. 
Specifically, Refs. [5,49 have explored modeling social networks using variations of the 
simple RGG model explored above, also finding that distributional properties of empirical 
data can be matched quite well. In both cases the walkers ‘slow down’ (decrease step 
size) upon forming links with other agents. This social slow-down results in denser (more 
realistic) gatherings, but due to the fundamentally random nature of walkers in each model, 
neither of these model display cores that appear multiple times. Thus, just like the simple 
dynamic RGG, these models do a surprisingly good job of simulating behavioral data for a 
single day (creates gatherings), but none of them capture behavior across weeks and months 
(unable to create realistic cores). 


S7.6 Summary 

In summary, the random geometric graph with random walkers provides an interesting 
model which is able to reproduces distributional properties of component sizes as well 
as component life-times quite accurately. Therefore this model is an excellent point of 
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comparison, when we want to understand the significance of our findings regarding the 
empirical dynamic networks. 

There are some important differences between the model and the empirical data. When 
we look at the RGG components, for example, we find that their spatial configuration is 
quite different from what we observe in real data. In order to reproduce the distributional 
properties, all model components must be packed in a very small area (a square with side 
length L ~ 280 meters) and the model component are sparser with much longer path-lengths 
than components in the real-world data. 

The most important difference between the empirical data and the simple RGG model, 
however, is that the latter does not display any meaningful core structure. As we have 
shown above, this implies that the predictability we observe in the empirical dataset is not 
replicated by the dynamic RGG model and can be considered a real effect expressed in the 
data. 


S8 Coordination of meetings 


This section describes how we calculate the amount of coordination leading up to gatherings. 
Prior to a meeting individuals might need to coordinate about when and where to meet. 
This coordination can be conveyed through various means: (1) individuals can organize in 
real time through electronic means, such as online social networks and mobile phones, (2) 
they can verbally have scheduled meetings beforehand, i.e. at previous gatherings, (3) or 
attend routine driven pre-scheduled meetings, arranged by an institution, e.g. the university. 
We consider the coordination in the hours leading up to a meeting, measured in terms of 
increased calling and texting activity. Because calling frequencies change over the course of 


a day, and because individuals can have fundamentally distinct calling patterns 110.50 51 


we compare activity leading up to a meeting to hour-by-hour dynamic individual baselines, 
defined as 


c t = 


1 N 


(S16) 


where N it the number of individuals participating in the gathering, af is the activity of 
person n, and ajf is the baseline activity. The equation denotes increased coordination 
levels t hours before a meeting. Because gatherings have a broad distribution of lifetimes 


(Fig. S9), we restrict the calculation to individuals that participate in the first hour of the 


meeting. According to Fig. |S34 r-b, meetings during the weekend require more coordination 
than meeting during weekdays. This implies that weekend behavior is less scheduled, 
emphasizing the problem of predicting behavior using traditional routine-based measures. 
In addition, Fig. S34: reveals that meetings, independently of size, require the same amount 


of coordination per person, illustrating the validity of Fig. |S34r for varying gathering sizes. 
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Figure S34: Increased coordination prior to a meeting. Calculated for all nodes participating in 
the first hour of a gathering, a, Required amounts of coordination between nodes, depending on 
when the gatherings meets. More coordination is required to organize meetings during weekends 
(Friday 4 pm - Sunday) than during weekdays (Monday - Friday 4 pm), b, Further sub-dividing the 
categories from panel a reveals that Fridays and Saturdays are special, c, Effect of size of the meeting 
on coordination. On average it requires equal amounts of coordination, per person, to organize a 
meeting, independent of the size of the group. 
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